Search CORE

31 research outputs found

De Novo DNA Assembly with a Genetic Algorithm Finds Accurate Genomes Even with Suboptimal Fitness

Author: A Nebro
C Ip
CS Chin
E Alba
K Bradnam
KJ Räihä
MS Poptsova
R Parsons
RJ Parsons
RL Warren
SL Salzberg
Y Cherukuri
Publication venue: Springer
Publication date: 01/04/2017
Field of study

Crossref

University of Twente Research Information

AST: An Automated Sequence-Sampling Method for Improving the Taxonomic Diversity of Gene Phylogenetic Trees

Author: A Dereeper
A Loytynoja
A Wehe
AR Nabhan
B Rannala
BG Hall
C Chauve
C Notredame
C Zhou
Chan Zhou
CR Linder
DA Benson
DJ Zwickl
DM Hillis
DT Jones
F Jacobsen
F Plazzi
F Ronquist
F Ronquist
Fenglou Mao
J Kim
J Pecon-Slattery
JA Eisen
Jinling Huang
Johann Peter Gogarten
JP Jenuth
JP Townsend
JP Townsend
K Katoh
K Katoh
K Liu
K Liu
K Tamura
KA Cranston
KB Li
KS Pick
L Liu
L Liu
M Poptsova
MN Price
MS Poptsova
MS Rosenberg
MS Rosenberg
N Lartillot
O Gascuel
Paul Jaak Janssen
PD Faith
RC Edgar
RD Page
RI Vane-Wright
S Guindon
S Guindon
S Nelesen
S Whelan
SF Altschul
T Frickey
Y Yin
Y Yin
Yanbin Yin
Ying Xu
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

A challenge in phylogenetic inference of gene trees is how to properly sample a large pool of homologous sequences to derive a good representative subset of sequences. Such a need arises in various applications, e.g. when (1) accuracy-oriented phylogenetic reconstruction methods may not be able to deal with a large pool of sequences due to their high demand in computing resources; (2) applications analyzing a collection of gene trees may prefer to use trees with fewer operational taxonomic units (OTUs), for instance for the detection of horizontal gene transfer events by identifying phylogenetic conflicts; and (3) the pool of available sequences is biased towards extensively studied species. In the past, the creation of subsamples often relied on manual selection. Here we present an Automated sequence-Sampling method for improving the Taxonomic diversity of gene phylogenetic trees, AST, to obtain representative sequences that maximize the taxonomic diversity of the sampled sequences. To demonstrate the effectiveness of AST, we have tested it to solve four problems, namely, inference of the evolutionary histories of the small ribosomal subunit protein S5 of E. coli, 16 S ribosomal RNAs and glycosyl-transferase gene family 8, and a study of ancient horizontal gene transfers from bacteria to plants. Our results show that the resolution of our computational results is almost as good as that of manual inference by domain experts, hence making the tool generally useful to phylogenetic studies by non-phylogeny specialists. The program is available at http://csbl.bmb.uga.edu/~zhouchan/AST.php

Crossref

Directory of Open Access Journals

PubMed Central

The University of North Carolina at Greensboro

ScholarShip

FigShare

EnzymeDetector: an integrated enzyme function prediction tool and database

Author: A Chang
AK Arakaki
AM Schnoes
C Bannert
C Camacho
C Claudel-Renard
C Médigue
D Frishman
D Soh
Dietmar Schomburg
GL Winsor
M Chitale
M Kanehisa
M Kanehisa
M Kanehisa
MC Walter
ML Riley
MS Poptsova
N Furnham
PA Fujita
Q She
R Caspi
S Misra
Susanne Quester
W Tian
Y Yang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Genome Majority Vote Improves Gene Predictions

Author: A Pallejà
A Pati
AE Tenney
AL Delcher
Christos A. Ouzounis
D Hyatt
D Vallenet
DP Herlemann
G Parra
I Korf
J Besemer
J Dunbar
John Dunbar
Judith D. Cohn
KE Rudd
M Alexandersson
M Dai
M Riley
M Walker
Michael E. Wall
MR Brent
MS Poptsova
P Flicek
R Guigó
RC Edgar
RG Skophammer
RK Aziz
SF Altschul
Sindhu Raghavan
SS Gross
SS Gross
WJ Bruno
WJ Bruno
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Recent studies have noted extensive inconsistencies in gene start sites among orthologous genes in related microbial genomes. Here we provide the first documented evidence that imposing gene start consistency improves the accuracy of gene start-site prediction. We applied an algorithm using a genome majority vote (GMV) scheme to increase the consistency of gene starts among orthologs. We used a set of validated Escherichia coli genes as a standard to quantify accuracy. Results showed that the GMV algorithm can correct hundreds of gene prediction errors in sets of five or ten genomes while introducing few errors. Using a conservative calculation, we project that GMV would resolve many inconsistencies and errors in publicly available microbial gene maps. Our simple and logical solution provides a notable advance toward accurate gene maps

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Texas ScholarWorks

Methods for detection of horizontal transfer of transposable elements in complete genomes

Author: Ashburner M
Azad RK
Azad RK
Becq J
Biémont C
Capy P
Clark JB
Cock PJ
Daniels S
Dupuy C
Elgion L.S. Loreto
Fall S
Flutre T
Gal-Mor O
Gilbert C
Gilbert C
Guindon S
Gurudatta B
Juhas M
Keeling PJ
Kim AC
Knight R
Koski LB
Lyubetsky VA
Marcos Oliveira de Carvalho
Marri PR
Medrano-Soto A
O'Brochta DA
Passel M
Plessis L
Podell S
Poptsova MS
Putonti C
Ragan MA
Ragan MA
Rocha E
Schaack S
Shi S-Y
Silva JC
Supek F
Vasconcelos ATR
Vernikos GS
Wang H
Wei X
Weinert LA
Zaneveld JR
Zhou Q
Publication venue: 'FapUNIFESP (SciELO)'
Publication date: 01/01/2012
Field of study

Crossref

eCAMBer: efficient support for large-scale comparative analysis of multiple bacterial strains

Author: A Palleja
A Pati
A Roetzer
C Camacho
CR Laing
D Hyatt
D Kim
D Vallenet
DE Wood
EJ Richardson
J Dunbar
J Zhou
J-F Yu
Jerzy Tiuryn
JJ Gillespie
JL Klassen
Limsoon Wong
M Touchon
M Wozniak
M Wozniak
M Wozniak
ME Wall
Michal Wozniak
MS Poptsova
NJ Loman
NM Daniels
P Fournier
P-R Loh
PD Karp
PJA Cock
S Kasif
SP Shah
SV Angiuoli
SV Angiuoli
T Yada
THA Ederveen
V Pavlović
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Blueprint for a minimal photoautotrophic cell: conserved and variable genes in Synechococcus elongatus PCC 7942

Author: A Danchin
A Dufresne
A Dufresne
A Moya
A Tomitani
AN Nikolskaya
Andrés Moya
AY Mulkidjanian
C Sugita
Carmen M González-Domenech
CH Kuo
CK Holtman
E Szathmáry
EC Nowack
EV Koonin
Fernando de la Cruz
G Dong
G Pósfai
GC Kettler
GM Pao
H Tettelin
HA Schmidt
J Castresana
J Felsenstein
JE Stajich
JL Pellequer
Juli Peretó
K Tamura
KW von Nägeli
L Aravind
LB Koski
Luis Delaye
M Breitbart
M Podar
MA Ragan
María P Garcillán-Barcia
MG Langille
ML Coleman
MS Poptsova
NJ Robinson
O Zhaxybayeva
OA Koksharova
PD Karp
PJ Robinson
R Ghai
R Simm
RC Edgar
RD Finn
RDM Page
S Karlin
S Kurtz
S Waack
SF Altschul
SJ Giovannoni
SJ Giovannoni
T Shi
V Daubin
V Kolisnychenko
W Hsiao
WD Swingley
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Background: Simpler biological systems should be easier to understand and to engineer towards pre-defined goals. One way to achieve biological simplicity is through genome minimization. Here we looked for genomic islands in the fresh water cyanobacteria Synechococcus elongatus PCC 7942 (genome size 2.7 Mb) that could be used as targets for deletion. We also looked for conserved genes that might be essential for cell survival.Results: By using a combination of methods we identified 170 xenologs, 136 ORFans and 1401 core genes in the genome of S. elongatus PCC 7942. These represent 6.5%, 5.2% and 53.6% of the annotated genes respectively. We considered that genes in genomic islands could be found if they showed a combination of: a) unusual G+C content; b) unusual phylogenetic similarity; and/or c) a small number of the highly iterated palindrome 1 (HIP1) motif plus an unusual codon usage. The origin of the largest genomic island by horizontal gene transfer (HGT) could be corroborated by lack of coverage among metagenomic sequences from a fresh water microbialite. Evidence is also presented that xenologous genes tend to cluster in operons. Interestingly, most genes coding for proteins with a diguanylate cyclase domain are predicted to be xenologs, suggesting a role for horizontal gene transfer in the evolution of Synechococcus sensory systems.Conclusions: Our estimates of genomic islands in PCC 7942 are larger than those predicted by other published methods like SIGI-HMM. Our results set a guide to non-essential genes in S. elongatus PCC 7942 indicating a path towards the engineering of a model photoautotrophic bacterial cell.Financial support was provided by grants BFU2009-12895-C02-01/BMC (Ministerio de Ciencia e Innovación, Spain), the European Community’s Seventh Framework Programme (FP7/2007-2013) under grant agreement number 212894 and Prometeo/2009/092 (Conselleria d’Educació, Generalitat Valenciana, Spain) to A. Moya. Work in the FdlC laboratory was supported by grants BFU2008-00995/BMC (Spanish Ministry of Education), RD06/0008/1012 (RETICS research network, Instituto de Salud Carlos III, Spanish Ministry of Health) and LSHM-CT- 2005_019023 (European VI Framework Program). Dr. González-Domenech was supported by grant from the University of Granada. LD, thanks to financial support from Facultad de Ciencias, Universidad Nacional Autónoma de México

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Springer - Publisher Connector

Directory of Open Access Journals

Repositorio Institucional Universidad de Granada

PubMed Central

Phylogenomic Analysis of Marine Roseobacters

Author: A Buchan
A Stamatakis
C Dutta
Carl Kingsford
Cathy H. Wu
CH Wu
CJ Creevey
CM Thomas
D Posada
DF Robinso
E Bapteste
E Lerat
E Susko
F Abascal
G Bouxin
G Talavera
GT Taylor
H Ochman
H Shimodaira
H Shimodaira
HA Schmidt
Hongzhan Huang
I Wagner-Dobler
I Wagner-Dobler
J Bergsten
J Castresana
J Felsenstein
JA Eisen
JD Thompson
JP Gogarten
JP Gogarten
JP Huelsenbeck
JR Brown
Kai Tang
KH Tang
L Li
LM Schouls
MA Moran
MS Poptsova
N Galtier
Nianzhi Jiao
NZ Jiao
O Zhaxybayeva
O Zhaxybayeva
R Jain
R Seshadri
RD Page
RG Beiko
RG Beiko
RL Charlebois
RL Tatusov
RS Poretsky
S Guindon
SF Altschul
SJ Sorensen
SM Sowell
T Brinkhoff
T Shi
TR Miller
V Daubin
VM Markowitz
Y Zhang
Y Zhao
ZS Kolber
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Background: Members of the Roseobacter clade which play a key role in the biogeochemical cycles of the ocean are diverse and abundant, comprising 10–25 % of the bacterioplankton in most marine surface waters. The rapid accumulation of whole-genome sequence data for the Roseobacter clade allows us to obtain a clearer picture of its evolution. Methodology/Principal Findings: In this study about 1,200 likely orthologous protein families were identified from 17 Roseobacter bacteria genomes. Functional annotations for these genes are provided by iProClass. Phylogenetic trees were constructed for each gene using maximum likelihood (ML) and neighbor joining (NJ). Putative organismal phylogenetic trees were built with phylogenomic methods. These trees were compared and analyzed using principal coordinates analysis (PCoA), approximately unbiased (AU) and Shimodaira–Hasegawa (SH) tests. A core set of 694 genes with vertical descent signal that are resistant to horizontal gene transfer (HGT) is used to reconstruct a robust organismal phylogeny. In addition, we also discovered the most likely 109 HGT genes. The core set contains genes that encode ribosomal apparatus, ABC transporters and chaperones often found in the environmental metagenomic and metatranscriptomic data. These genes in the core set are spread out uniformly among the various functional classes and biological processes. Conclusions/Significance: Here we report a new multigene-derived phylogenetic tree of the Roseobacter clade. Of particular interest is the HGT of eleven genes involved in vitamin B12 synthesis as well as key enzynmes fo

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Xiamen University Institutional Repository